Foldering voicemail messages by caller using text independent speaker recognition

نویسندگان

  • Aaron E. Rosenberg
  • Sarangarajan Parthasarathy
  • Julia Hirschberg
  • Stephen Whittaker
چکیده

The ability to automatically scan voicemail messages for content and caller identity cues would be a useful service. This paper describes a system which automatically les voicemail messages into caller folders using text independent speaker recognition techniques. Callers are represented by Gaussian mixture models (GMM's). The speech for an incoming message is processed and scored against caller models created for a subscriber. A message whose matching score exceeds a threshold is led in the matching caller folder; otherwise it is tagged as \unknown". The subscriber has the ability to listen to an \unknown" message and le it in the proper folder, if it exists, or create a new folder, if it does not. Such subscriber labelled messages are used to train and adapt caller models. The system has been evaluated on a database of voicemail messages collected at AT&T Labs. A set of 20 callers from this database is designated as \ingroup". Each of these callers has recorded at least 20 messages totalling 10 or more minutes in duration. A distinct set of 220 messages, each from a di erent caller, are designated as \outgroup". Representative performance gures with threshold parameters set to ensure that outgroup acceptance is low compared with ingroup rejection are the following. The average ingroup message rejection rate is 11.0% and the average ingroup message confusion rate (matching the wrong caller) is 1.0%, while the average outgroup message accept rate is 2.7%.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Caller identification for the SCANMail voicemail browser

SCANMail is a prototype system developed at AT&T Labs for the purpose of providing useful tools for managing and searching through voicemail messages. Content is extracted from voicemail messages using various speech and text processing tools. One such content category is the identity of the message caller. This paper describes CallerID, the server tool attached to SCANMail for the purpose of p...

متن کامل

SCANMail: Audio Navigation in the Voicemail Domain

This paper describes SCANMail, a system that allows users to browse and search their voicemail messages by content through a GUI. Content based navigation is realized by use of automatic speech recognition, information retrieval, information extraction and human computer interaction technology. In addition to the browsing and querying functionalities, acoustics-based caller ID technology is use...

متن کامل

A Speaker Identification Agent

This paper describes a prototype application which combines speaker identification technology and an agent architecture to provide userdefinable monitors for incoming voicemail messages. Through a Webdistributable Java user interface, the user may enter requests by using spoken or typed natural language. Multiple distributed agents process the requests, periodically testing the user's voicemail...

متن کامل

A study of adaptation techniques on a voicemail transcription task

Speaker adaptation techniques have emerged as very effective and practical methods to improve ASR performance on a test speaker with only limited speech data from the speaker. We explore the use of adaptation techniques on a new Voicemail database and present some theoretical extensions of the Cluster Transformation (CT) technique. Our experiments on 40 hours of voicemail data and four clusters...

متن کامل

Information Extraction from Voicemail

In this paper we address the problem of extracting key pieces of information from voicemail messages, such as the identity and phone number of the caller. This task differs from the named entity task in that the information we are interested in is a subset of the named entities in the message, and consequently, the need to pick the correct subset makes the problem more difficult. Also, the call...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000